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We consider a tournament among four equally strong semifinalists. The players have to decide 
how much stamina to use in the semifinals, provided that the rest is available in the final and 
the third-place playoff. We investigate optimal strategies for allocating stamina to the successive 
matches when players’ prizes (payoffs) are given according to the tournament results. From the 
basic assumption that the probability to win a match follows a nondecreasing function of stamina 
difference, we present symmetric Nash equilibria for general payoff structures. We find three different 
phases of the Nash equilibria in the payoff space. First, when the champion wins a much bigger payoff 
than the others, any pure strategy can constitute a Nash equilibrium as long as all four players adopt 
it in common. Second, when the first two places are much more valuable than the other two, the only 
Nash equilibrium is such that everyone uses a pure strategy investing all stamina in the semifinal. 
Third, when the payoff for last place is much smaller than the others, a Nash equilibrium is formed 
when every player adopts a mixed strategy of using all or none of its stamina in the semifinals. In 
a limiting case that only last place pays the penalty, this mixed-strategy profile can be proved to 
be a unique symmetric Nash equilibrium, at least when the winning probability follows a Heaviside 
step function. Moreover, by using this Heaviside step function, we study the tournament by using 
evolutionary replicator dynamics to obtain analytic solutions, which reproduces the corresponding 
Nash equilibria on the population level and gives information on dynamic aspects. 


I. INTRODUCTION 

During the 1938 FIFA World Cup in France, Adhemar 
Pimenta, the coach of the Brazilian national team, was 
facing a dilemma: Spearheaded by Leonidas da Silva, 
the Brazilians had defeated the Polish in extra time and 
consecutively eliminated Czechoslovakia in a replay after 
the “Battle of Bordeaux.” Pimenta was becoming wor¬ 
ried about the fatigue accumulation of his team members. 
The next opponent in the semifinal was Italy, the reign¬ 
ing world champion at that time. Worse was that the 
final with Hungary would not be easier. After chang¬ 
ing the starting lineup eight times, the coach decided to 
play the semifinal against Italy without Leonidas. Then 
Brazil lost to Italy. 

One might say retrospectively that the coach was im¬ 
prudent. However, it is not a trivial question what would 
have been a better choice in that situation, especially for 
the person directly involved. Let us look back at their 
next encounter during the 1970 World Cup in Mexico. 
This time, Italy advanced to the final after the “Game of 
the Century” against Beckenbauer’s Germany. However, 
the energy exhausted in the semifinal turned out to be 
such a great loss that the Azzurri was utterly defeated 
by the Brazilians in the final. So they had to watch help¬ 
lessly as Brazil took permanent ownership of the Jules 
Rimet Trophy. 

This kind of dilemma is by no means rare in tourna- 
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ment competitions. It might have originated from the un¬ 
fairness of sports tournament compared to sports leagues, 
which is traded off against the efficiency of tournament 
competitions M. How to distribute stamina over a 
series of matches takes up an important part of the play¬ 
ers’ strategies. We also point out that the dilemma cap¬ 
tures an aspect of our society as a series of competitions, 
which is an essential part of our daily life from television 
shows like Project Runway to presidential elections with 
the two-round system. A sports tournament serves as 
a striking metaphor here, as clearly seen from the fact 
that people often talk about fair play, front-runners and 
a knock-out punch in these activities as well. In order to 
investigate this dilemma, we consider a simple model for 
a two-round tournament among four players with equal 
stamina. The players have to decide how much stamina 
to use in the first round (i.e., semifinals) provided that 
only the rest is available in the second round. The sec¬ 
ond rounds take place between winners (losers) of the 
first round for the championship (third place). We as¬ 
sume that the outcome of a match is described by a well- 
defined probability function. This function should con¬ 
tain all the essential information of the sport under con¬ 
sideration, such as the rules to decide the winner (for ad¬ 
ditional discussions about the dynamics of competitions 
see Refs. [1,@.) It gives the probability for a player to de¬ 
feat an opponent as a function of their invested stamina, 
so it will be called the winning probability function Q. 
The chance is 50:50 when two players spend the same 
amount of stamina and it is reasonable that the winning 
probability is a nondecreasing function of the stamina 
difference. At the end of the tournament, each player 
gets a prize (payoff) for the finishing place. We consider 



2 


a general payoff structure under a plausible constraint 
that the payoff never decreases as a player moves up in 
rank. 

Having defined the game by choosing the winning prob¬ 
ability function together with the payoffs, we have to ask 
ourselves what we mean by solving this game. In game 
theory, the Nash equilibrium is the most well-known so¬ 
lution concept: Once it is achieved, players cannot be 
better off by changing their own strategy alone @ . One 
may also investigate the game from an evolutionary point 
of view. The replicator dyiramics is widely used to study 
the evolution of an infinite population with pure strate¬ 
gies [Ml. In this study, we apply both methods to our 
tournament model for a better understanding and com¬ 
pare the results to check their consistency. 

In terms of Nash equilibrium, we find three different 
phases, which divide the payoff space into three regions, 
(i) First, when the champion wins a much bigger pay¬ 
off than the others, any pure strategy can constitute a 
Nash equilibrium as long as all four players adopt it in 
common, (ii) If the payoff for the second-place winner 
is also valuable enough, one has no reason to fight in 
the second round. Therefore, the only Nash equilibrium 
is such that everyone spends all the stamina in the first 
round, i.e, semifinals, (iii) Finally, if the payoff for last 
place is much smaller than the others, a Nash equilibrium 
emerges when everyone adopts a mixed strategy using all 
or none of the stamina with proper weights. 

The above three phases are also analytically tractable 
from the viewpoint of evolutionary dynamics when the 
winning probability function is the Heaviside step func¬ 
tion of the stamina difference between two players. The 
results of the replicator dynamics are as follows. For case 
(i), the population evolves to a single species adopting a 
common pure strategy. Which pure strategy to adopt in 
the long run depends on the initial strategy distribution. 
For case (ii), the population evolves to a single species 
using all the stamina in the first round. For case (iii), 
the population becomes a genetic polymorphism of two 
species. One spends all the stamina in the first round, 
while the other reserves it all for the second round. The 
proportions of these species correspond to the weights of 
the mixed-strategy Nash equilibrium in case (iii). These 
solutions are fully consistent with the analysis of Nash 
equilibria. 

This paper is organized as follows. In the next section 
we define our game of the two-round tournament in detail 
by specifying its strategy space and payoff structure. In 
Sec. nni we focus on symmetric Nash equilibria of this 
game. We begin with simple limiting payoff structures 
and then proceed to a general payoff structure to find 
Nash equilibria for the entire payoff space. This analysis 
is followed in Sec. llVI bv the evolutionary dynamics of an 
infinite population with pure strategies and a comparison 
of the results with the Nash equilibria. 


II. MODEL 

Let us consider four equally strong semifinalists and 
denote them by A, B, C, and D, respectively. In the 
first round, player A meets B, while C meets D. The 
second round takes place between winners (for the cham¬ 
pionship) and between losers (for third place) of the first 
round. The players’ payoffs are given according to the 
tournament result. The numerical values of the payoffs 
are denoted by s for the champion, u for the second-place 
winner, v for the third-place winner, and w for last place, 
where w < v < u < s. Nash equilibria and replicator dy¬ 
namics are invariant under translation and rescaling of 
the payoffs by a positive factor, so we may set w = 0 and 
s = 1 without loss of generality. Then the payoff param¬ 
eter space reduces to the (m, v) plane with 0 < u < u < 1. 

Each player’s strategy determines how much stamina 
will be spent in the first round provided that the rest 
is available in the second round. We assume that all the 
players have an equal amount of stamina at the beginning 
and normalize it as 1. Then player z’s strategy, which 
is generally a mixed one, is expressed by a normalized 
distribution function of the mixing weights over a 

closed interval [0,1], i.e., cj)i{x)dx = 1. If the player 
adopts a pure strategy of investing Xi in the first round, 
for example, (j)i{x) = S{x — xt). As a slight abuse of 
notation, we will often abbreviate such a pure strategy 
as Xi- 

Suppose that the players have formed a strategy pro¬ 
file [3 {(j)A,4’BT4 ’Ct4’d)- In order to calculate ex¬ 
pected payoffs, we need the winning probability function 
f{xi,Xj). It tells us how likely player i is to defeat player 
j when they use stamina Xi and Xj, respectively. We 
furthermore assume that / is a nondecreasing function 
of the stamina difference Ax^ = Xi — xj € [—1,1], i.e., 
f(xi,Xj) = f{xi — Xj) = f{Axij). This implies that 
a player’s winning probability never decreases as that 
player’s investment increases. We do not consider a draw 
as a match outcome and require /(Ax^) -I- f{Axji) = 1. 
Then we have /(O) = i, i.e., two players spending the 
same amount of stamina have an equal chance to win 
the match. It is convenient to consider only the relative 
difference from /(O) by defining h{Ax) = /(Ax) — /(O), 
which is an odd function, because h{Ax) + h{—Ax) = 0. 
It is non-negative for Ax > 0 and has a maximum at 
Ax = 1. The maximum is bounded by ^ from above, 
because /(Ax) = h{Ax) + ^ < 1- 

To demonstrate how to calculate the expected payoffs, 
let us assume that the players’ moves are xa,xb,xc, and 
XD^ respectively. For player A, the probability to win the 
semifinal is defined as ^ -I- /iab, where hij = h{xi — Xj). 
If player A has really made it, the remaining stamina for 
the final must be 1 — xa- Then, who will be A’s next 
opponent in the final? With probability ^ + he d, it is 
player C with remaining stamina 1 — xc- Player A will 
defeat C with probability i -I- h[{l — xa) — (1 — xc)] = 
5 — hAC- Alternatively, the opponent can be player D 
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with probability ^ ~ he d , and player A will defeat D with 
probability ^ — hAD- Therefore, we find the probability 
of A to be the champion as 


— + hAB 


^ + hcD 



+ 




( 1 ) 


It is straightforward to find the probability to take second 
or third place as well. Then player A’s expected payoff 
TTA for these particular moves xa, xb, xc, and is 
expressed as 


TTA '■ 


a 


- hAB 




+ 








- hcA (i 


- r) A r-\ 


+ 



- + hAD 


+v 




hcD 



( 2 ) 


This should be averaged over the distribution functions 
(jTAixA), (t>B(.XB), (t)c{xc), and (t>D{xD) to yield the even¬ 
tual expected payoff for player A. The other players’ 
payoffs are readily obtained by permuting the indices. 

Note that our model is similar to the war of attrition 
(WA) in that each strategy is defined on a continuous 
space, leading to an infinite-sized payoff matrix. In ad¬ 
dition, the WA bears a strong similarity to the chicken 
game, as ours does when it matters not to be the loser. 
One might even think of the semifinalists’ dilemma as a 
variant of this famous game, where each player’s remain¬ 
ing resources after the bid become as much important as 
the victory itself. Obviously, such a variation would just 
reflect the existence of the second round in our case. In 
addition, this modification will induce a player to give 
up the first round more easily whenever the opponent 
appears aggressive enough. Beyond a qualitative level, 
however, it is hard to predict the behavior, like the ratio 
between investing 0% and 100% in the first round, by 
this analogy. At the same time, we point out a subtle 
difference of our game in that it is essentially one’s own 
bid in the first round that determines one’s remaining re¬ 
sources, whereas it would rather be one’s first opponent’s 

bid in the WA I, Em- 


Ill. NASH EQUILIBRIA 


the champion who has won both rounds gets a prize, 
so it can be regarded as a winner-take-all system. We 
will show that there exists an infinite number of Nash 
equilibria of pure strategies where the players coordinate 
their strategies by choosing the same value. If someone 
spends less in the first round than this coordination, that 
player becomes more likely to lose the first match. If 
someone spends more, on the other hand, that player is 
risking the chance to win the second round. 

We can argue this statement more rigorously by taking 
player A’s viewpoint. If A uses a pure strategy xa while 
the other three use a common pure strategy x*, then 
hcD = 0 and hAB = hAc = hAD = h{xA - x*) = h. 
Equation m thus reduces to 

tta{xa, x*,x*,x*) = - — < 7ta{x*,x*,x*, x*) = 

(3) 

Therefore, the pure-strategy profile S* = {x*, x*, x*, x*) 
is a Nash equilibrium for any a;* G [0 1] if u = = 0. 


A. Region I 

Now let us check how robust the Nash equilibrium S* is 
for a general payoff structure. Player A’s expected payoff 
of Eq. ([5]) becomes 7rA{x*,x*,x*,x*) = |(1 -1-14-1-1;) be¬ 
cause the total prize should be equally distributed among 
the four players. If player A uses xa while the other three 
use a common strategy x*, Eq. is simplified to 



where h = h{xA — x*). For the strategy profile S* 
to be a Nash equilibrium, Att^ = tta{x* ,x*,x*,x*) — 
TTAixAjX*,x*,x*) > 0 for any xa G [0,1]. From Eq. (g]) 
we have 


Atta = h{v — u) + h?{l — u — v) (5) 

= h[(l — h)v — (1 + h)u -b h], (6) 

First, consider the case of xa > x*, i.e., h > 0. The 
payoff difference Atta is non-negative only when 


V > 


1 + h 


1 - h 


u — 


h 

1 - h' 


(7) 


We have assumed u > v, so the inequality Eq. © auto¬ 
matically implies that 


u > 


1 h 


1 — h 


u — 


h 


( 8 ) 


Let us first consider a simple limiting case of the payoff 
structure by setting it = i; = 0. In other words, only 


which is satisfied only when u < ^. If this is the case, 
the right-hand side (RHS) of Eq. (O is a nonincreasing 
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function of h because 


d fl + h 


u — 


dh \ l — h I — h 


2m- 1 

(T^ 


< 0 . 


(9) 


Let Hq be the greatest lower bound of h{xA — x*) for 
XA > X*. If a certain payoff structure represented by 
(m, v) satisfies Eq. © for ft, = ho, it is guaranteed by 
Eq. (|9]) that Eq. 0 holds true for any ft > ftp. In other 
words, Atta remains non-negative for any xa > x* as 
long as 


I -|- fto 

V > -— u — 

I — fto 


1 — ftn 


( 10 ) 


If Xa < X*, on the other hand, we have ft < 0. Rewriting 
Eq. dS]) as Att^ = \{1 — u — v) -\- |ft|“^('u — u)], we see 

that it is always non-negative for v < u < i. 

Therefore, a strategy profile S* = {x*,x*, x*, x*) is a 
Nash equilibrium for any x* € [0,1] if the payoff structure 
(m, v) satisfies the inequality Eq. dlo}. We call this area 
region I in the (m, v) payoff space. This region is not 
observable if fto vanishes: For example, if Ax can be 
infinitesimally small and h{Ax) continuously approaches 
zero as Ax —^ 0, then S* remains as a Nash equilibrium 
only for 0 < u = u < i unless x* = 1. On the other 
hand, region I becomes the largest when fto = 5 , i.e., 
when /(Ax) is the Heaviside step function as follows: 


Region II is the set of (w, v) points at which Si is a Nash 
equilibrium but S* = (x*, x*, x*,x*) is not for x* < 1. If 
fti = ^, for example, this region is bounded by w < |(w + 
1), M < 3m — 1, and 0 < m < m < 1. We depict a graphical 
representation of regions I and II in Fig. [T] Note from 
Eqs. (fTOjl and (fl^ that the boundaries meet at (m, v) = 
(i, i) for any fto and fti in general. It is very interesting 
that the phase boundaries are determined only by both 
the extreme values fto and hi, and are independent of the 
shape of the function ft(Ax) for 0 < Ax < I. 



/(Ax) = 


Ax < 0 
Ax = 0 
Ax > 0. 


( 11 ) 


In this case, the region is bounded by m > 3 m — 1 and 
0 < M < M < 1 . 


B. Region II 

If X* = 1, it is impossible to have xa > x* and we only 
need to consider ft < 0. Then, Atta expressed in Eq. ® 
is always non-negative for m < 1 because both terms on 
the RHS are non-negative due to m < m. For m > 1, it is 

which tells us that Atta is 


better to work with Eq. 
non-negative if 


1 -I- ft 


1 - ft 


( 12 ) 


As shown in Eq. (I9|), if m > the RHS of Eq. (fl^ 
only grows as ft = h(xA — 1) increases. Therefore, once 
(m,m) satisfies Eq. (fl^ for the minimum of h{xA — 1), 
the inequality holds true for any xa- Due to the fact 
that ft(Ax) is a nondecreasing function, the minimum of 
h{xA — 1) should be ft(—I) = —h(l) = —hi. We thus 
conclude that the strategy profile = ( 1 , 1 , 1 , 1 ) is the 
only Nash equilibrium if 


1 — fti 

V < -—M ' 

1 -f- fti 


hi 



FIG. 1. (Color online) Schematic phase diagrams with a gen¬ 
eral payoff structure, where u is the relative payoff of the 
second-place winner and v is that of the third-place winner, 
for (a) fto > 0 and (b) fto = 0. The region of interest u > v 
is divided into three regions, denoted by I, II, and III, re¬ 
spectively. Region I is characterized by the strategy profile 
S* = {x*,x*,x*,x*) for any pure strategy x* £ [0,1] and 
bounded by Eq. GQl). However, as drawn in (b), it shrinks to 
a line segment of0<M = ii<|if minAa,>o ft(Ax) = 0. In 
region II bounded by Eq. dSl), S* is not a Nash equilibrium 
unless X* — 1. If u is large enough, the players may have a 
common mixed strategy as a Nash equilibrium in region III 
(see the text for details). 


1 -|- fti 


( 13 ) 
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C. Region III 


The remaining region is given by 


~ hi hi 

1 + hi^ I + /11’ 


(14) 


which is referred to as region III. We assume hi > 0 be¬ 
cause hi = 0 would mean that each player’s expected 
payoff is trivially j{l + u + v) irrespective of one’s strat¬ 
egy. Let us consider a simple limiting payoff structure 
again by setting u = v = \. This is disadvantageous 
only for last place, so we may call it a loser-pay-all as 
a rhyme for the winner-take-all system given above. If 
only the one that has lost both matches gets nothing, the 
immediate goal should be to win at least one match. The 
important keyword here is concentration. It means that 
it is advisable to concentrate either on the first round 
or on the second round. For this reason, we look for a 
mixed-strategy Nash equilibrium in the form 


nip = p6{x) + {1 — p)S{l — x) (15) 

with p € [0,1] and indeed find a strategy profile Sm = 
{rup* ,mp» ^rup* ,mp») as a Nash equilibrium, where 


to Eq. (flBIl as {u, v) —>• (1,1). It is also worth noting that 
Eq. (fT71) vanishes, i.e., p* —0, so Sm converges to as 
V approaches the boundary given in Eq. (HI. 

Interestingly, Sm can be shown to be the only sym¬ 
metric Nash equilibrium for u = v = 1, when the win¬ 
ning probability is taken as the Heaviside step function 
[Eq. (fTTl) ]. Suppose that every player is using a mixed 
strategy m{x) given by 

TO(a;) = p(5(a;)-I-9(5(1 — a;)-I-g(a;), (19) 


where p and q are probabilities of using a; = 0 and a; = 1, 
respectively, and g{x) is non-negative on (0,1). The con¬ 
servation of total probability requires that g{x)dx = 
\ — p — q>Q. Noting that player H’s initial payoff must 
be I due to symmetry, we will check whether A can be 
better off by adopting rup*. When A has adopted nip *, 
whereas the others are still using m(x), player H’s payoff 
-KA can readily be calculated, because it depends on p 
and q but not on the shape of g{x), for nip* consisting of 
a; = 0 and 1 only. Straightforward calculation leads to 


TTA = 1 - 


8p-8p^-K2p3- (i + v^)(g3 

4(3 -f VZ) 



( 20 ) 


1 hi — hi 

^i 


(16) 


The proof is given in Appendix A. It is noteworthy that 
when /ii = i, the mixing weights p* and 1 — p* are 
and = 1 — respectively, where p = {1 + \/Z)l2 
is the golden ratio. This means that Sm is a Nash equi¬ 
librium if the weights of concentration between the first 
and second rounds are in the golden ratio. The condition 
that hi = ^ is often reasonable because one cannot win 
without making an effort, especially if the opponent is 
doing the best. 

We can repeat the same procedure for general pay¬ 
offs in region III. The above result is generalized by the 
finding that Sm = {nip*, nip*, nip*, nip*) is a Nash equi¬ 
librium, where 


p*{u,v) 


khi — I — i/k^hi + P — 2fc2 -I- 2k 
2khi 


(17) 


with k = 1 — u + V and I = 1 — u — v. In Appendix B we 
prove that 

TIA (x, nip *, nip *, nip *) < tta (nip *, nip *, mp *, nip *) 

= ^(l + w + ^^)( 18 ) 

for any x € [0,1] under the condition that h{Ax) > 0 for 
Ax > 0. The equality holds at both a; = 0 and x = 1. In 
other words, when player A adopts a concentration strat¬ 
egy in the form of Eq. dig, A’s payoff is predetermined 
as (1 -I- u + u)/4, independent of the choice of p. In that 
sense, nir,* can be considered as a partial equalizer strat¬ 
egy [1^, [1^ . One can readily check that Eq. dig reduces 


and we can show that this is always greater than | unless 
ni{x) = nip*. Therefore, any symmetric strategy profile 
cannot be sustained as an equilibrium except for Sm- 
It is also instructive to consider other strategy pro¬ 
files without such symmetry. Eor example, let us choose 
f{Ax) as 


/(Aa;) 


1 - for Ax > 0, 

^^Ax/T otherwise, 


where T is a parameter for controlling sensitivity to the 
difference of the stamina Q . This choice permits a strat¬ 
egy profile S'lo = (1,0,1,0) (and other combinations ex¬ 
changed between A and H, or C and D) to be a Nash 
equilibrium at {u,v) = (1,1), as shown in Appendix C. 
It is another possible manifestation of concentration, the 
keyword of region III, in addition to the mixed-strategy 
profile Sm- This seems to be parallel to the following 
asymmetric Nash equilibrium in the WA [1, i-®-; 

one player bids zero while the other does any number 
equal to or higher than the value of the resource in hand. 
We observe a similar situation in the chicken game as well 
because it has one symmetric mixed-strategy equilibrium 
plus two asymmetric pure-strategy equilibria where one 
player plays dove and the other plays hawk. There seems 
to be a class of games that permits both symmetric and 
symmetry-breaking solutions. 


IV. EVOLUTIONARY DYNAMICS 

In this section we consider replicator dynamics for the 
evolution of an infinite population with pure strategies. 
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When f{Ax) is chosen as Eq. (|2T]) with E —>■ 0, this evo¬ 
lutionary framework provides an alternative derivation 
or interpretation of the Nash equilibria considered in the 
previous section. The condition that T —^ 0 can be un¬ 
derstood as restricting our interest to a particular case 
of ho = hi = ^ [see, e.g., Eq. (fTTI) ]. 


A. Winner-take-all 


If M = r; = 0, only the final victory matters and there 
exists an infinite number of Nash equilibria where the 
players coordinate their strategies. We will review this 
result by introducing an evolutionary process governing 
an infinite population in which each individual has a pure 
strategy x. The distribution of x inside the population at 
time t is denoted by the probability density P{x; t) and 
assumed to be continuous at any finite t. We assume 
that the population evolves in such a way that successful 
strategies gradually increase their portions by replacing 
inferior ones, as is mathematically formulated as follows: 

^P{x;t) = [TT{x;t) - {TT)]P{x;t), (22) 

where Tr{x; t) means the payoff gained by strategy x at 
time t and (tt) = f 7r(x)P(x)dx is the average payoff of 
the population. For brevity, we will suppress the depen¬ 
dence on t henceforth. The evolutionary process as in 
Eq. (1^^ is called the replicator dynamics ©,[ 111 . Even 
on a continuous strategy space, the re plic ator dynam¬ 
ics is well defined with no modification [^. The initial 
population P(x; t = 0) is nonzero everywhere in the unit 
interval [0,1]. As mentioned above, we only consider a 
situation where /(As) is sharp enough to be approxi¬ 
mated as the Heaviside step function (CH) for mathemat¬ 
ical tractability. 

When u = V = 0, the expected payoff for player A 
by playing xa is equivalent to the average of Eq. o 
over P{xb), P{xc), and P{xd), which we will denote by 
tta{xa)- It is convenient to define 


c{x) = j dx'P{x')f{x 

Jo 



dx'P{x'), 


(23) 


and some algebra with integration by parts leads to 

TfA(xA) = 2 / dXBP{XB)fAB 

Jo 

X / dxc P{xc)fcA / dxDP{xD)fcD 

Jo I Jo 

= 2c{xa) / dxP{x)c{x) 

Jxa 


= c{xa) [l - c^{xa)] 


(24) 


where /y = f{xi — xj). Note that P{x) = ^c(s) with 
c(0) = 0 and c(l) = 1. One can also readily calculate the 


population average as 

/•i 1 

{Wa) = / dxc{x) [1 - c^(s)] P{x) = - (25) 

Jo a 

by an integration by parts again. The fact that (tta) = \ 
is a natural result of the symmetry among the four play¬ 
ers, i.e., equal probability to win on average. Therefore, 
the replicator dynamics reduces to 


= 


c(s) - c^[x) - - 


P{x). 


Equivalently, we may rewrite it as 





1 4 1 \ 

UJ 

dx 


4^ 4^/ 


(26) 


(27) 


which means that 


dc 




(28) 


where /(t) is a function of t only. Because c(s = 0;t) = 
0 and c(s = l;t) = 1 , /(t) simply turns out to be 0 . 
Therefore, we have 


f)f> 1 

- = --c(c-l)(c2 + c-l). (29) 


The RHS vanishes at ci = 0, C 2 = = (\/5 — l)/2 « 

0.618, and C 3 = 1. Plotting Eq. (1^ shows that dc/dt 
is negative between ci and C 2 , and positive between C 2 
and C 3 [see Fig. [21[a)]. In other words, C 2 is an unstable 
fixed point whereas the other two are stable. Therefore, 
if we consider x = x such that c(x) = the function 
c{x) converges to zero when x < x, whereas it converges 
to one when x > x, which means that a sharp peak in 
P{x) develops at x as t increases. Note that it is the cu¬ 
mulative probability that determines x, so the peak po¬ 
sition X depends on the choice of the initial distribution 
P{x;t = 0). In addition, Eq. (12^ provides an explicit 
example of mapping the replicator dynamics onto a re¬ 
action system, parametrized in terms of c{x) [H, [l^. To 
sum up, the replicator dynamics selects a single point x 
characterized by c(x) = as a refined solution among 
infinitely many possibilities. 


B. Loser-pay-all 

It is also straightforward to recast the case of u = v = 
1 into the evolutionary framework. Let La be player 
A’s probability to be last; we denote its average over 
P{xb), P{xc), and P{xd) as La- The gained payoff of 
A corresponds to the probability not to be last; it can 
be written as Wa = 1 — La- One can readily check that 
La = (1 — c)( 2 c — c^) and the corresponding replicator 
dynamics is given as 

^P(x) = - \La{x) - {La)] P{x)- 


(30) 
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FIG. 2. (Color online) Replicator dynamics for (a) u = v = 
0 and (b) m = f = 1. (a) Among the three fixed points 

where ff = 0, the middle one (the open circle) at c = is 
unstable, whereas the other two (the closed circles) are stable. 
In the middle row, c is shown as a function of x (the thick red 
line) for a uniform strategy distribution (the thin blue line). 
It decreases to zero for x < while it increases to one for 
X > 1 resulting in a step function jumping at a; = as 

shown in the bottom row. Hence, P{x) has a sharp peak at 
X = in the steady state, (b) The only stable fixed point 
is at c = The cumulative probability c decreases if it 

is larger than and increases when it is smaller than 
resulting in the c{x) configuration in the bottom row: It is 
zero at a: = 0, one at a; = 1 and c = ^p~'^ elsewhere, regardless 
of the initial configuration. Hence, in the steady state, P{x) 
has two peaks, one at a; = 0 and the other at a; = 1. 


c 

o 

- 4 —< 

O 
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0 0.2 0.4 0.6 0.8 1 
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FIG. 3. (Color online) Fraction of population choosing a pure 
strategy x at time t, obtained by numerical integration of 
the replicator equation with spatial and temporal step sizes 
dx = 10“® and dt = 10“^. The initial distribution is assumed 
uniform so that each x has probability P{x\t = 0)dx = 10 “®. 
(a) The distribution converges toward x = (the dotted 
vertical line) in the winner-take-all case (u = u = 0), while (b) 
Two peaks emerge at a: = 0 with weight and x = 1 with 
weight ip~^ in the loser-pay-all case {u = v = 1 ). Although 
the simulation is a discretized version of the replicator equa¬ 
tion due to the nature of numerical simulation, the results 
nicely confirm the analytic solutions for continuous strategy 
space. 


The symmetry implies (La) = j again, so Eq. (150)) re¬ 
duces to the following: 

f)c 1 

- = --c(c-l)(c2-3c+l). (31) 

The RHS vanishes at c = 0,1, or = 1 — = 

(3 - Vb)/2 ^ 0.382, werein only c = ip ^ is the stable 
fixed point [see Fig.[2][b)]. This shows that the replicator 
dynamics reproduces the common mixed-strategy Nash 
equilibrium at u = = 1 on the population level. 


C. General payoff systems 

For general u and v, the same averaging process as 
above yields the following equation: 

= ^c(c-l)n(c;M,u), (32) 


where n(c; u,v) = {u — v — l)c^ -I- (it -I- 3t; — l)c -I- (u — 
3t; -b 1). The main questions are the location of c* such 
that n(c*; M, i;) = 0 and its stability. From 1 — u + v ^ 0, 
it is straightforward to find 


c*±{u,v) 


— (1 — u — 3u) ± y^D{u,v) 
2(1 — u + v) 


(33) 


withll('u,'u) = (1 —u—3u)^-|-4(l —'u-|-z;)(u—3'y-l-l). Note 
that —>■ ip~^ as {u,v) —( 0 , 0 ), whereas cl — ip~‘^ as 
(u^v) —> (Ijl)- It is worth mentioning that = 1 if 
V = 3u — 1, while cl = 0 if u = 3v — 1, because these 
two lines determine the shape of the phase diagram as 
depicted in Fig. [Il[a). It is interesting that the phase 
diagram suggests duality under reflection across u + v = 
1 , and the replicator equation is actually covariant under 
the transformation (tt, v,c,t) ^ (1 — u, 1 — u, 1 — c, —t). 

If "u > 3u—1, which corresponds to region I in Fig.ITJa), 
only c^(u, v) in Eq. (155)) lies inside the unit interval [0,1] 
as an unstable fixed point. On the other hand, in region 
III where v > ^(u -I- 1), only c*_{u^v) is found inside the 
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FIG. 4. (Color online) (a) Cnmulative c and (b) the cor¬ 
responding fraction P as a function of x at three different 
values of t (with u = v = 0), provided that we start from 
an initial distribution P{x;t = 0) = jZ with a normal¬ 
ization constant Z = (1 — e~^)/4. The curves are obtained 
by numerical integration of replicator dynamics in the same 
way as explained in Fig. [3] The vertical dotted lines represent 
X = X ~ 0.233, and the horizontal one does c = (see the 
text). 




interval and it is stable. In the rest of the possible region 
of u > w, neither of Cj- is feasible, and c = 0 appears as 
a stable fixed point. This implies that P{x) evolves to 
(5(1 — i) in the long run if the second-place prize is worth 
enough, which means that all players do their best in the 
semifinal to advance to the final in region II. 

Furthermore, let us check the population average of 
the strategies in the long-time limit, written as {x) = 
f dx [xP{x;t —>■ oo)]. It is simply 1 over region II, where 
the distribution is driven to a, 6 peak at a; = 1. In 
region III, the average is found to be 1 — c*_, which is 
larger than or equal to where the equality holds at 
(m, v) = (1,1). In region I, (x) is dependent on the initial 
condition in general. However, if we start from a uniform 
distribution P(x, t = 0) = I, we see that (x) = d^, which 
is again greater than or equal to and the equality 
holds at {u,v) = ( 0 , 0 ). 


D. Numerical calculation 

To provide an intuitive example, we perform a nu¬ 
merical simulation of the replicator dynamics for two 
representative cases: the winner-take-all (u = v = 0) 
and loser-pay-all {u = v = 1) cases with /(Ax) given 
by Eq. m- Figure 3 shows the fraction inside the 
population choosing a pure strategy x at time t. We 


assume the initial distribution to be uniform such as 
P(x; t = Q)dx = 10 “^ because we assume a finite res¬ 
olution dx = 10“^ for the simulation. The population 
converges to x = in the winner-take-all case. On the 
other hand, the loser-pay-all case shows two peaks, one 
at X = 0 and the other at x = 1 . 

Another important piece of information is how quickly 
the Nash equilibrium is approached in this dynamics. To 
answer this question, let us linearize Eq. at each 
fixed point as follows: 


dc 

m 


— {c—ci)/ts at Cl = 0 
+ (c- C 2 )/tu at C 2 = (\/5 - l)/2 (34) 

-(c - C 3 )/Ts at C 3 = I, 


where Ts = 4 is a time scale to approach one of the stable 
fixed points ci and C 3 , and = 4/(5 — 2\/b) ~ 7.578 is 
another time scale to get away from the unstable fixed 
point C 2 . The stable fixed points lie at a distance of 
0{e~^) from C 2 , so the overall time scale for convergence 
can be estimated as t ~ -|- ~ 0 ( 10 ), which is con¬ 

sistent with Eig. [21 This argument gives the same r for 
the loser-pay-all case when it applies to Eq. (ED). 

Suppose that we instead take a nonuniform distribu¬ 
tion P(x;f = 0 ) = with a normalization con¬ 

stant Z = (1 — e“^)/4 as the initial condition. Based 
on the discussion in the last paragraph of Sec. IIV A[ 
we find that the cumulative fraction c equals (p~^ at 
i = ln{2eV[(3 - ^)e^ -b \/5 - l)]}/4 a; 0.233. This 
point remains invariant as an unstable fixed point when 
t increases, whereas c = 0 and c = I are stable [Fig.lDa)]. 
As a consequence, P{x) develops a peak at x as shown 
in Eig.jTKb). One can readily generalize this result to an 
arbitrary initial distribution whose support is the unit 
interval [ 0 , 1 ]. 

We may also consider a finite population consisting of 
N individuals evolving with the Moran process (see, e.g., 
Ref. [13 for a review). In this process, we choose an in¬ 
dividual for reproduction with probability proportional 
to the payoff, which is readily calculated from the distri¬ 
bution of strategies. We then randomly choose an indi¬ 
vidual for death with equal probability, regardless of the 
payoff, and it can be the same individual that we have 
chosen for reproduction. The former individual makes 
a copy to replace the latter individual chosen for death 
and such an update is repeated N times during a sin¬ 
gle time step to give an equal chance to everyone. For 
computational convenience, we assume that a strategy 
has finite resolution dx = 10“^. Figure [5] shows typical 
results when initial strategies are sampled from a uni¬ 
form random distribution. When the population size is 
large enough, we observe convergence with a time scale of 
t ~ 0 ( 10 ) time steps for both the winner-take-all and the 
loser-pay-all cases. The behavior of a small population 
is more complicated by the discreteness and deserves a 
systematic investigation in a future study. 
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FIG. 5. (Color online) Cumulative distribution resulting 
from the Moran process for a finite population of N indi¬ 
viduals. We consider N = 10^ and 10^, and assume finite 
resolution of dx = 10“® in defining strategy x. Each curve 
represents a single run with an initial condition sampled from 
a uniform random distribution of x. (a) In the winner-take- 
all case (u = V = 0 and t — 10), the cumulative distribution 
develops a jump near x « 0.6 for N = 10^, implying a sin¬ 
gle peak in the strategy distribution around x « 0.6, after 
0(10) time steps, (b) For u = v = 1, N = 10"*, and t = 20, 
the behavior is again consistent with the result of replicator 
dynamics in that we find a plateau of height « 0.4, when 
t 0(10) time steps. 


V. SUMMARY AND DISCUSSION 


We have proposed a simple example of distributing a 
finite amount of resources in competitions. Despite the 
complexity of the decision problem among four players, 
we have found two important keywords, i.e., balance and 
concentration, which can serve as a practical guide to 
strategic thinking. There are two representative cases 
for these keywords, i.e., u = v = 0 and u = v = 1. The 
former, in particular, has infinitely many Nash equilib¬ 
ria, which are characterized by any common strategy for 
all the semifinalists. The coordination would be achiev¬ 
able in the presence of c heap talk, as in the case of a 
usual coordination game [^. In the latter, the situa¬ 
tion is similar to the chicken game, because it is better 
to give in if your opponent really goes for broke to win 
the semifinal, as illustrated by Siq. The mixed-strategy 
Nash equilibrium is such that you stake all at the semifi¬ 
nals with probability « 0.618 and give up the game 
with = 1 — (f~^ « 0.382. The emergence of the 
golden ratio is fascinating, because it is the most impor¬ 
tant ratio of division in mathematics, also known as the 
most irrational number due to its slowest convergence in 


the continued fraction . We have also shown that the 
mixed-strategy Nash equilibrium is the only symmetric 
solution when we use the Heaviside step function to de¬ 
termine the probability of winning. 

We have also investigated the problem from an evolu¬ 
tionary point of view by introducing the replicator dy¬ 
namics. As mentioned above, a number of different Nash 
equilibria exist for some payoff structures. The replica¬ 
tor dynamics selects one of them depending on the ini¬ 
tial population distribution. Those Nash equilibria can 
be accessed via a certain learning process, i.e., updating 
the strategy based on its performance. For the winner- 
take-all case, the replicator dynamics converges to a pure 
strategy characterized by the golden ratio again, i.e., at 
position X where the cumulative probability is equal to 
(p~^. On the other hand, for the loser-pay-all case, two 
peaks emerge, one at a; = 0 with weight and the 
other at X = 1 with weight ip~^. The replicator dynamics 
result not only provides an alternative derivation for the 
Nash equilibria on the population level, but also sheds 
light on the duality behind the (u, v) phase diagram in 
the limit of small F. 

If we take the Sochi 2014 Olympics as an example 
of (u,f), the relative price of the raw material to pro¬ 
duce a silver medal compared with that of a gold medal 
roughly corresponds to rt Ri 0.6 and that of a bronze 
medal amounts to v < 1%. This parameter set belongs 
to region II, where everyone cares only about the semifi¬ 
nals in the long term. However, this only proves that the 
raw material price is a poor measure of assessing the true 
values of the medals, because the finals in the Olympics 
have remained thrilling throughout the century. 

From a little different viewpoint, our work suggests 
how to design an incentive system to affect the behav¬ 
ior of individuals under structured competition: We can 
imagine members of an organization who compete to win 
a position in the hierarchy with a limited amount of re¬ 
sources. If only the top position is rewarded in effect, for 
example, it will signal to the members that the organi¬ 
zation favors generalists rather than specialists, making 
them conservative in investing effort into specific tasks. 
They will even experience a social dilemma when the 
induced behavioral characteristics contradict the organi¬ 
zation’s goals. As mentioned above, our life is shaped to 
a great extent by a series of competitions in an organized 
society. In this respect, our tournament model will serve 
as a starting point to investigate the effects of structured 
competition on our behavior in various contexts. 
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Appendix A: Nash equilibrium for u = v = 1 


We will prove that a strategy profile Sm = 
{mp *, vrip *, TOp*, mp») is a Nash equilibrium for u = v = 1, 

where nip = p5{x) + {l—p)5{\ — x) andp* = 
with hi = h{l) > 0 . 

Suppose that player A spends x in the semifinal, while 
the others have a certain mixed strategy rup. We define 
La{x, Trip, Trip, nip) as the probability that A becomes last 
in the tournament in this situation. It is a product of two 
probabilities Li and L 2 : The former is the probability to 
lose the semifinal with expending x. The latter is a con¬ 
ditional probability to lose the third-place playoff given 
that A has already lost the first match with expending 


X. 


Let us begin with the semifinal between A and B. 
Player B spends all or nothing with probability 1 — p 
and p, so B’s probability to defeat A is given as 


Li=pf{-x) + {l-p)f{l-x). (Al) 


For the third-place playoff, player A has remaining 
stamina \ — x. To calculate L 2 , however, we need to know 
A’s opponent’s characteristics, resulting from the other 
semifinal between C and D. For the semifinal between 
C and D, there are three possibilities in their strate¬ 
gic choices: (i) With probability p^, both use the strat¬ 
egy of a; = 0 ; (ii) with probability (1 — p)^, both use 
x = 1 ; (hi) with probability 2 p(l — p), one uses x = 0 
and the other uses x = 1. For the case (hi), the prob¬ 
ability that the one with x = 1 to lose the semifinal is 
/(—I). Therefore, the probability that the loser of the 
semifinal between C and D has used up its total stamina 
is (1 — p)^ -I- 2p(l — p)/(—1) = q. In other words, this is 
the probability for player A to meet an opponent with no 
remaining stamina. The idea is that player A effectively 
experiences its opponent’s strategy as rUg. That is, 

L 2 = gf(x - 1) + (1 - g)f(x), (A2) 

whereby we obtain LA{x,mp,mp,mp). Some algebra 
shows that La{0, m-p, nip, nip) = La{1, , nip, nip, nip) = | 
when p = p*. For 0 < x < 1, we plug the explicit expres¬ 
sion of p* into La{x, nip*, nip*, nip*) and find that 

Ah\LA{x, nip*, nip*, nip *) = [/ii -|- (a -|- b)G — (a — b)hi] 
X [hi -I- (a -I- b)G -|- (a — b)hi], (A3) 

where G = ^/l + hf — 1, a = h{l — x), and b = h{x). 
Noting that G^ = h\ — 2G, we expand the RHS as 

[hi + {a + b)GY - [{a - b)hiY (A4) 

= h\ + 2{a + b)hiG + {a + b)^G^ -(a- b)^hl (A5) 
= hl+ Aabh\ + 2{a + b){hi — a — b)G. (A 6 ) 


This expression is invariant under the exchange between 
a and b, which allows us to assume that a > b without 
loss of generality. As hi > a = h{l — x), putting a in 
place of hi results in the following inequality: 

4hiLyi(x, nip*, nip*, nip*) > hf + Aabh\ — 2{a + b)bG. 

(A7) 

We now replace 6 by a in the last set of parentheses to 
get 

Ah\LA{x,nip*,nip*,nip*) > h\ + Aabh\ — AabG 

= h\+Aab{hl-G). (A8) 

Finally, we use the fact that h\—G = \-\-h\— 1 + h^ > 0 

to derive 

AhfLA{x,nip*,nip*,mp*) > h\, (A9) 

which implies LA{x,nip* ,mp* ,nip*) > j. 

Due to the symmetry among the players, it is ob¬ 
vious that LA{nip* ,nip* ,nip* ,nip*) = j. Because 
LA{x,nrip*,nrip*,nip*) > LA{nip*,nip*,nip*,nrip*) for any 
X € (0,1), player A has no reason to choose such x. 
This argument tells us that A’s best choice must be a 
certain mixed strategy rUp = pS(x) -I- (1 — p)S(l — x). 
However, we have already seen that this links A’s pay¬ 
off to j irrespective of p, because La( 0, nip*, nip*, nip *) = 
La{ 1, nip*, nip*, nip*) = \- For this reason, player A may 
adopt nip* as well and we conclude that the strategy pro¬ 
file Sm = (ixip*, nip*, nip*, nip *) is a Nash equilibrium. 


Appendix B: Mixed-strategy Nash equilibrium for 
general u and v 


We will extend the above conclusion to the interior of 
region III by proving that Sm with p*(u,v) in Eq. (EZD 
is a Nash equilibrium. It is con venient to define k = 
1 — u + v, I = 1 — u — V, and z = ijk'^hf + P — 2k'^ + 2k. 
As above, we also have hi = h{l) which is assumed to 
be 0 < hi < i. The interior of region III is described by 
k + I > 0 and 1 + Ihi < k < 1 [compare with Eq. (IT4l) ]. 
Note also that k is strictly positive. In terms of these 
variables, Eq. (HID can be rewritten as 


p*{u,v) 


1 I + z 

2 2khi 


(Bl) 


The outline of the proof is similar to the above one 
for u = x = 1: Suppose that player A spends x at the 
semifinal, while all the others use nip. The probability 
for A to defeat B is 


Wi=pf{x) + {l-p)f{x-l). (B2) 

Regarding the final, we first consider strategies in the 
semifinal between C and D and calculate the probabil¬ 
ity r that A’s opponent in the final has no remaining 
stamina. It is given as 

r = 2 p(l -p)/(I) -k (1 -p)^ 


(B3) 
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Therefore, the probability for A to win the final is thus 
W 2 = rf(l - x) + (1 - r)f(-x). (B4) 

It is straightforward to write down ^’s payoff as 
■jTA{x,mp,mp,mp) = WiW2+uWi{l-W2)+vLi{l-L 2 ), 

(B5) 

where Li and L 2 are as defined above. Due to symmetry, 
we see that 

TTA{mp, Trip, Trip, nip) = -(1 + u + u) = -(2 - Z). (B 6 ) 

If we require TrA{0,mp,mp,mp) = 7 rA(l, rup, rup, rup) = 
■|(1 + M + ?;), it is satisfied only at p*{u, v). 

Now we have to show that 

TTA{x,mp»,Tnp* ,mp*) < ^{1 + u +v) (B7) 

for X G ( 0 , 1 ), from which it follows that nip* constitutes 
a Nash equilibrium. We obtain an explicit expression 
of the left-hand side by inserting p*{u,v) into Eq. (IB5I) . 
which is written as 

TTAix,mp* ^rup^ ,mp*) = {(2 ~ h)a?'h\ 

+ 2ujih{l — x)[h{l — x) — hi] (B 8 ) 

-I- 2uj2h{x)[h{x) — hi] (B9) 

+'iuj3,h{x)h{l — x)} , (BIO) 


Appendix C: Strategy profile 5'io = (1,0,1,0) at 
(w,u) = (1,1) 


Let us choose the following sigmoid function: 


/(Ax) = 


1 - for Ax > 0, 

igAx/r otherwise. 


(Cl) 


where T is a positive parameter to control the width. It 
will be shown that S'lo = (1,0,1,0) a Nash equilibrium 
with this specific choice. Of course, other combinations 
exchanging players A and B, or C and D, are equivalent. 
To prove that S'lo is a Nash equilibrium, we consider 
the following two strategic configurations from player A’s 
point of view. 

First, suppose that the opponent player B has thrown 
in the towel, i.e., the profile is given as (x, 0,1,0). We 
have to find the value of x that minimizes the prob¬ 
ability La{x) for player A to be last. For La{x) = 
[1 - /(a;)][/(-l)/(x - I) + /(l)/(x)], its derivative is 
readily written in terms of /(x) and Note that the 
function /(x) has the following property under differen¬ 
tiation: 

df{x) ^ , j for x > 0 , 

dx 1 otherwise. 


2 r'- ~ r 

Using this property, we find that 

ALa 


(C2) 


dx 


= r-^[l-/(x)]/(l)[l-2/(x)]. (C3) 


where 

wi = l^{l + z) + {21 + z)k{l — k) + k^{l — k)hi 

LO 2 = l^{l -|- z) -|- {21 + z)k{l — k) — k^{l — k)hi 

wa = P{1 + z) + {21 + z)k{l — k) + k'^lh'l. (Bll) 

In region III, wi and 1 x 2 are positive, whereas wa is neg¬ 

ative. Using hi > h{l — x), we replace h{l — x) in the 
first set of square brackets of Eq. (IB 8 I) by hi. Likewise, 
because hi > h{x), we do the same with h{x) in the sec¬ 
ond set of square brackets in Eq. (IB9() . The result is the 
following inequality: 

'KA{x,nip*,mp*,mp*)< (BI2) 

{Ak^h\)~^\{2 — l)k^hl + 4a;a/i(x)/i(I — x)]. 

Clearly, the last term in the square brackets of Eq. ()B13ll 
is negative as long as /i(x) > 0 for x > 0, which leads us 
to the conclusion that 

tta{x, rup *, rup *, nip*) < (4fc^h^)“^(2 — l)k'^h\ 

= 1 ( 2 - 0 . 


The above expression cannot be positive because ^ < 
/(x) < I, which implies that player A is motivated to 
have xa = 1 to minimize La- This choice is reasonable 
because A must win this match with strong possibility at 
any cost. 

Second, as an opposite situation, suppose that player B 
stakes all on a single throw, i.e., where the profile is given 
as (x, 1,1,0). If every other player is concentrating all the 
efforts on either this round or the next one, player A has 
no reason to play by halves because such a strategy would 
always put A in a weaker position than the opponent. 
This statement is mathematically expressed by < 0, 
where La{x) = [1 - /(x - l)][/(-I)/(x - 1) -h /(^l)/(x)] 
is the probability for player A to be last. The inequality 
implies that even if the first derivative vanishes, it 
will be a maximum of L^, so the minima should be found 
at the boundary points, i.e., either at x = 0 or at x = 
1. Direct calculation shows that La{x = 0) — La{x = 
1 ) = 3 [/(I) - 1 ][ 2 /( 1 ) - 1]2 < 0 for any possible /(I) 
between i and I. In short, the minimum of La appears 
at X = 0. This means that player A should prepare for 
the next match avoiding all-out war this time to reduce 
the probability to be last. 
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